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Abstract 

The input of the Test Cover problem consists of a set V of vertices, and a collection 
£ = {Ex,... ,E m } of distinct subsets of V, called tests. A test E q separates a pair Vi,Vj 
of vertices if \{vi, Vj} n E q \ = 1. A subcollection T C £ is a test cover if each pair Vi,Vj of 
distinct vertices is separated by a test in T ■ The objective is to find a test cover of minimum 
cardinality, if one exists. This problem is NP-hard. 

We consider two parameterizations the Test Cover problem with parameter k: (a) decide 
whether there is a test cover with at most k tests, (b) decide whether there is a test cover with 
at most |V| — k tests. Both parameterizations are known to be fixed-parameter tractable. We 
prove that none have a polynomial size kernel unless NP C coN P/ poly . Our proofs use the 
cross-composition method recently introduced by Bodlaender et al. (2011) and parametric 
duality introduced by Chen et al. (2005). The result for the parameterization (a) was 
an open problem (private communications with Henning Fernau and Jiong Guo, Jan. -Feb. 
2012). We also show that the parameterization (a) admits a polynomial size kernel if the 
size of each test is upper-bounded by a constant. 

1 Introduction 

In the Test Cover problem defined below we are given a set V = {t>i, . . . , v n } of element called 
vertices and a family £ = {E\, . . . , E m } of distinct subsets of V called tests. We say that a test 
E q separates a pair Vi,Vj of vertices if \{vi, vj} n E q \ = 1. A subset T of £ is called a test cover 
if for every pair of distinct vertices Vi,Vj there exists a test T € T separating them. 



TestCover(p) 

Input: A set V = {v±, . . . , v n } of vertices, a family £ = {E\, . . . , E m } of tests and an 
integer p. 

Task: Decide whether there exists a test cover with at most p tests. 



The (optimization version of) Test Cover problem arises naturally in the following general 
setting of identification problems: Given a set of items (i.e., vertices) and a set of binary at- 
tributes that may or may not occur in each item, the aim is to find a minimum-size subset of 
attributes (a minimum test cover) such that each items can be uniquely identified from the infor- 
mation on the subset of attributes it contains. The Test Cover problem arises in fault analysis, 
medical diagnostics, pattern recognition, and biological identification (see, e.g., [T0 l [TT j PT2])- 

The TestCover(p) problem is NP-hard, as was shown by Garey and Johnson [8]. Moreover, 
TestCover(p) is APX-hard [10J. There is an 0(log n)-approximation algorithm for the problem 
|12] and there is no o(log n)-approximation algorithm unless P=NP [10J. 

The generic parameterized version TestCover(p, k) of TestCover(p) is defined as follows. 
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TestCover(j>, k) 

Input: A set V = {vi, . . . , v n }, a family £ = {E±, . . . , E m } of subsets of V and two 
integers p and k. 
Parameter: k. 

Task: Decide whether there exists a test cover with at most p tests. 

In this paper, we study two parameterizations of TestCover: the standard parameteriza- 
tion TestCover(A;, k) and its duaQ TESTCovER(n — k,k). Both parameterizations are known 
to be fixed-parameter tractable [5]. Fixed-parameter tractability of TestCover(/c, k) is easy 
to see using the fact that there is no test cover of size less than [logn] [10]. Indeed, if k < logn 
then the corresponding instance of TestCover(/c, k) is a No-instance, and if k > logn then 
n < 2 k and so m < 2 2 allowing us to solve TestCover(/j, k) by a brute force fixed-parameter 
algorithm. The proof in [5] that TestCover(?i — k, k) is fixed-parameter tractable is much 
harder. 

We first prove that TestCover(/c, k) does not admit a polynomial kernel unless NP C 
coNP/poly. The latter is deemed unlikely as it is known to imply a collapse of the polynomial 
hierarchy to its third level |14| . Note that our first result solves an open problem [6j [9]. Our 
proof uses the cross-composition method recently introduced by Bodlaender et al. [2]. 

We show that TestCover(£;, k) does admit a polynomial kernel if the size of each test is 
bounded by a constant. This constraint is used in some practical applications of TestCover 
|10| . We prove that TESTCovER(n — k,k) does not admit a polynomial kernel unless NP C 
coNP/poly. This result follows from our first result and a general result on nonexistence of 
polynomial kernels in dual parameterized problems also obtained in our paper. We use the 
notion of dual introduced by Chen et al. [?| as well as the cross-composition method. 

Our paper is organized as follows. In the rest of this section we will give a couple of simple, 
yet very useful, definitions on TestCover. In the next section we give basics on parameterized 
algorithms and kernelization as well as on the cross-composition method. In Section [31 we prove 
all our results. In Section 0] we pose some open problems. 

A partial test cover T' to an instance of TestCover(p, k) is a subset of 8 of cardinality at 
most p. We say that C C V is a class induced by a partial test cover T' , if C is a maximal set 
such that there does not exist a test in T' which separates two vertices of C. Notice that the 
classes induced by every partial test cover form a partition of V and every test cover induces n 
classes. For a positive integer p, let [p] = {1, 2, . . . ,p}. 

2 Fixed-Parameter Algorithms, Kernels and Cross-composition 
Method 

A parameterized problem P is a subset PC S*xN over a finite alphabet S. P is fixed-parameter 
tractable if the membership of an instance (x, k) of E* x N in P can be decided by an algorithm 
of runtime f {k)\x\ oi < l \ where / is a function of the parameter k only (such an algorithm is a 
fixed-parameter algorithm) [3l [13] . Given a parameterized problem P, a kernelization of P 
is a polynomial-time algorithm that maps an instance (x,k) to an instance (x',k') (the kernel) 
such that (i) (x,k) G P if and only if (x',k') £ P, (ii) k' < h(k), and (iii) \x'\ < g(k) for some 
functions h and g. The function g(k) is called the size of the kernel. It is well-known [3j [13] 
that a decidable parameterized problem P is fixed-parameter tractable if and only if it has a 
kernel. Polynomial-size kernels are of main interest, due to applications [31 [71 [13], but unfor- 
tunately many fixed-parameter tractable problems have no such kernels unless coNPCNP/poly, 
see, e.g., [I]. 

The following two definitions and Theorem [1] were given by Bodlaender et al. [2]. 
1 See Definition 0] for a formal notion of parameterized duality. 



2 



Definition 1 (Polynomial equivalence relation). An equivalence relation 1Z on X* is called a 
polynomial equivalence relation if the following two conditions hold: 

• There is an algorithm that given two strings x, y G X* decides whether x and y belong to 
the same equivalence class in (|x| + time. 

• For any finite set S C X* the equivalence relation 1Z partitions the elements of S into at 
most (max xS 5 |as | equivalence classes. 

Definition 2 (Cross-composition). Let L C X* be a problem and let Q C X* x N be a pa- 
rameterized problem. We say that L cross- composes into Q if there is a polynomial equivalence 
relation 7£ and an algorithm which, given t strings xi,...,xt belonging to the same equivalence 
class of 1Z, computes an instance (x* , k*) G X* x N in time polynomial in Xa=i \ x i\ sucri that: 

• {x* , k*) G Q if and only if X{ G L for some 1 < i < t. 

• fc* is bounded by a polynomial in max* =1 \xi\ + logt. 

Theorem 1. If some problem L is NP-hard under Karp reductions and L cross-composes into 
the parameterized problem Q then there is no polynomial kernel for Q unless NP C coNP/poly. 

3 Results 

Theorem 2. TestCover(k,k) does not admit a polynomial kernel, unless NP C coNP/poly. 

Proof. We will use Theorem [1] to show the result, hence we need an ./VP-hard problem L which 
cross-composes into TestCover(fc, k). For this purpose, it is possible to use TestCover(p). An 
instance of this problem is a triple (V,£,p). We say that two triples (V\,£i,pi) and (^,^2,^2) 
are equivalent if \V\ \ = | V2 1 and p\ = P2- 

It is not difficult to see that this defines a polynomial equivalence relation on TestCover(p) 
(see Definition [1]). To see that the second condition of Definition [1] holds, observe that the 
relation thus defined partitions a finite set S into at most 0{n-m) equivalence classes, assuming 
that for every (V,£,p) G S we have that \V\ < n and \£\ < m (since p < \£\). 

Consider now t instances Qi, ■ ■ ■ ,Qt of TestCover(p), belonging to the same equivalence 
class. For every i G [t], let V = {v±, . . . , v n } be the set of vertices of Qi, £{ = {E\, . . . , E^.} 
its set of tests and p be the upper bound on the size of the solution. Let I = 2[(logt)/2]. We 
will construct an instance Q* of TestCover(/c, k) such that Q* has a solution of size at most 
k = 21 + p if and only if at least one of Qi, . . . , Qt has a solution of size at most p. This will be 
enough to prove our result (see Definition [2] and Theorem [T]) . 

The vertex set of Q* is defined in the following way: 

V(Q*) = {v 1 ,...,v n }U (uf =1 { yj ,x{, . . . ,4» U {01, . . . ,ai}. 

The tests in Q* are defined as follows. Consider some i G [t] and let . . . ,i[) be its binary 
representation (e.g. if i = 11 and I = 6, ■ ■ ■ ,ii) = (1, 1, 0, 1, 0, 0)). For every i, consider the 
following family of sets Si (where subscripts are taken modulo k): 

Si = ^{x h ,x h+il ,x\,x h+i2 ,x\,x^ l+i3 , . . . ,x h ,%h+i t } '■ h = 1> • • • ) Pi- 
Define £(Q*) = £*U£ 1 U---U£ t , where 

c* _ / J" 2 i-! 2?- 1 2?- 1 2j— l-i / 2j 2j 2j 2j\ . ■ c ml 

c — \\Cij, y2j-\, x^ ,x 2 ,x 3 , . . . , x p 1, \aj, y2j, x^ , x 2 , x 3 . . . ,x p j . j t [tj j 

and 

£ i = {(SUE i j ):je[m i ],SeS i }. 
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This completes the definition of Q*. 

Assume that there is a solution to Q* using at most 21 + p tests. Since a\ and yn-\ need to 
be separated and similarly for a, and yn for all i, all 21 tests in £* must be used (because they 
are the only tests where Gtj, y<a-\ arid yn appear); notice also that they are enough to separate 
them. Moreover, as every set in fj (for all i) only intersects X{ = {x\,x l 2 , . . . , x~} in one vertex 
and as all vertices in Xi_ have to be separated from yi, the tests not in £* all have to contain a 
distinct vertex from Xi q 

Consider now a test E in £ = {Jj =1 £j. Its intersection with Xi U Xi+i, for every % € 
{1, 3, ... ,21 — 1}, contains only vertices which are in 5 for some S € Uj=i 

Suppose S £ 5 r for some r (this ensures that £ £ r ) and suppose that it is equal to 

where (ri, . . . , r{) is the binary representation of r and h £ [p]. 

The intersection with U Xi+\ is given by the black vertices in next figure if r*+i = 0: 

2 

X\ X\ x{ X h+l X P~1 4 

o o ••• • o ••• o o 



.7' 



o 

i+1 



If, on the contrary, n+i = 1, the intersection is given by the black vertices in next figure: 



o 



o 



o 



~i+l 



,i+l ™t+l 



„i+l 



.7' 



o 

i+1 



These are the only possible types of intersection. 

Notice that, if E is part of the solution, in order to cover all vertices in Xi U X^\ (for every 
%) it is possible to use only tests in £ r . Otherwise, if a test in £ s is used (with r ^ s), there 
exists i* such that ?> ^ Si* and it is impossible to cover all vertices in Xi* U Xi* + \. Therefore, 
if there is a solution to Q* using 21 + p tests, then 21 tests are taken from £* and p tests from 
E r for some r. This ensures that Q r has a solution using at most p tests. 

To prove the opposite direction, assume that Q r has a solution of size at most p. Using all 
sets in £* and carefully choosing p sets from £ r H, it is possible to obtain a solution to Q* with 
at most 21 + p tests. This completes the proof. □ 

Definition 3. The generic parameterized problem Test-r-Cover(p, k) is defined as TestCover(p, k), 
except that for an instance (V,£,p,k) we have that r < \V\ and \E\ < r for every E £ £. 

As the next theorem shows, this additional condition enables us to obtain a kernel which is 
linear in the number of vertices in the case of Test-r-Cover(A:, k). 

2 Notice that this ensures that a solution to Q* has at least 21 + p vertices. Therefore we are actually showing 
that Q* has a solution of size exactly 4 [(log t) /2] + p if and only if at least one of Qi, . . . , Qt has a solution of 
size at most p. 

3 The p tests from £ r have to contain the tests used in a solution to Q r and tests S which are chosen accordingly 
to the binary representation of r, in order to cover all vertices in (J^Li 
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Theorem 3. Test-r-Cover(k, k) admits a kernel with at most r ■ k — (|_logrJ — 1) ■ r vertices. 



Proof. Notice that s tests can create at most 2 s classes: more specifically, adding a test to a 
partial test cover which induces t classes produces a partial test cover with at most 2t classes. 
This happens because in order to produce a new class, the test must contain a subset C of 
a class C such that C is not empty and it does not coincide with C and if this happens for 
every class, the number of classes doubles. At the same time, since every test contains at most 
r vertices, this can happen with at most r different classes. Therefore, given a partial test cover 
which induces t classes, adding a new test can create at most min{i, r} new classes. Using k 
tests it is possible to create at most 2L logr J + (k — [logrJ)r classes, since the first [log tests 
can create at most 2 L lo s classes and every other test increases the number of classes by at most 
r. It follows that we have a No-instance unless 

n < 2'- logr J + (k — |logrJ)r < r + (k — |logrJ)r = kr — (|_logrJ — l)r. 

□ 

Corollary 1. Test-r - Cover (k,k) admits a polynomial kernel when r is a constant. 

Proof. Theorem [3] ensures the existence of a kernel with at most r ■ k — (|logrJ — 1) • r vertices. 
This means that the number m of tests is at most 

E (?) ^ E OfV e 0(r(erkY) = 0(F). 

s=l ^ ' s=l ^ ' 

□ 

Corollary 2. Consider an instance (V,£,k) of TestCover(k,k) and let r = max£ g £ \E\. If 
r G 0(n 1_e ) for some e > 0, then the problem admits a kernel which is polynomial in the 
number of vertices. 

Proof. By Theorem O n < kr. Now, for some positive constant c it holds that kr < kcn 1 ^ 6 , 
which ensures that n e < kc, that is n < (kc)^ . □ 

The proof of Theorem [2] can be modified to show the non-existence of a polynomial kernel 
for the parameterized problem TestCover(n — k, k). However, it is possible to obtain this result 
from a more general theorem. First of all, consider a definition of dual problem equivalent to 
the one given by Chen et al. [1]: 

Definition 4. Let P be a parameterized problem, i.e., P C X* x N. A mapping s : S* — > N is 
a size function for P if 

• < k < s(x) for every (x, k) € P, and 

• s(x) < \x\ for every x £ X*. 

The dual P^ of a problem P respectively to the size function s is the problem corresponding to 
the language (i.e., the set of YES-instances) Pd = {(x,s(x) — k) : (x,k) £ P}. Pd is also called 
the s-dual of P. 

Notice that the s-dual of the s-dual of a problem is again the original problem. 

Theorem 4. Given a parameterized problem P , if P admits a cross- composition from a NP-hard 
problem L, let (x,k) be the instance which is associated to the t strings x\, . . . ,xt by the cross- 
composition algorithm. If there exists a size function s(x) which is bounded by a polynomial 
in max* =1 \xi\ + logt, then the s-dual problem Pd does not admit a polynomial kernel, unless 
NP C coNP/poly. 
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Proof. We will show that if the hypothesis hold, Pd admits a cross-composition from the same 
problem L. Hence we conclude using Theorem [TJ By Definition [21 we know that there exists i 
such that Xi € L if and only if (x, k) € P. By definition of s-dual problem, this is equivalent 
to (x,s(x) — k) £ Pd- This provides a cross-composition for P^, as long as the parameter 
k! = s(x) — k is a polynomial in max* =1 \xi\ + logt, that is as long as s(x) is a polynomial in 
max* =1 \xi | + log t. □ 

Now, notice that TestCover(n — k, k) is the s-dual problem of TestCover(/j, k), with s(Q) = n. 
Notice also that in Theorem [21 s(Q*) = n + 21 ■ (k + 1) + I, which is a polynomial in n + I < 
max* =1 \Qi\ + logt. Hence, the hypothesis of Theorem 0] holds and there is no polynomial kernel 
for TestCover(n — k,k). Thus, we have the following: 

Theorem 5. The problem TestCover(n — k, k) does not admit a polynomial kernel, unless NP C 
coNP/poly. 

4 Open Problems 

In Corollaries Q] and [2] we give sufficient conditions for the existence of kernels of polynomial 
size and polynomial in the number of vertices, respectively. It would be interesting to obtain 
extensions of the corollaries. 
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